Television messaging vocal response generation

ABSTRACT

In a messaging system adapted to operate in a television environment, the invention teaches a method and apparatus for allowing a user to enter textual outgoing message, which is translated to speech by text to speech module. The speech is than delivered to a target server such as a voice mail or e-mail server, for listening by the message recipient.

FILED OF THE INVENTION

[0001] The present invention relates generally to messaging systems andmore specifically to providing vocal response to messages without theneed to record respondent voice.

BACKGROUND OF THE INVENTION

[0002] Television messaging has been described in documents such as WO00/44173 to the present inventor. In the same document a system thatprovides for recording a respondent message was also described. The userrecords a response, which is captured and then transmitted to theoriginator of the message. The system may be used for generation, aswell as for responding to, messages. Different methods of messaging havebeen described by Different methods of messaging have been described bythe like of U.S. Pat. No. 5,844,552 to Gaughan et al, Hsu et al in WO98/56188, by WO 00/44173 to Bartfeld—the present inventor, and byothers. WO 00/44173 has other applications pending in the US and othercountries around the world. Television based messaging is a growingtrend that offers ready access to messaging emanating from differentsources, utilizing the television wide bandwidth and ready availabilityto provide easy access to the electronic messaging domain.

[0003] Recording the user voice is not always the best approach. Itrequires a voice capture device, i.e. a microphone, to be placed eitherin proximity to the television or the set-top box, or in proximity tothe user. Placing the microphone in proximity to the TV or the set-topbox gives rise to problems of environmental noise, and users are oftenmoving around. Additionally, often the user is reluctant to record hisvoice due to privacy issues, or not wanting to disturb others.

[0004] While the internet and other digital communications means allowthe delivery of text messages, voice based messaging systems such asvoice mail, allow access to the message using commonly available devicesuch as telephones. By eliminating the need for a data reading device,the messages become accessible remotely or to users that do not havecomputers handy.

[0005] The translation of text to speech (TXTS hereinafter) is wellknown in the art. It is commonly used in such systems as banking andother automated response systems, as well as for use by people withspeech or site disabilities. It is mainly used to translate data frommachine to a user, not from user to user, as generally ample speechtransmission medium such a telephone, voice mail, or simply speaking ataudible range, are abundant, readily available and well known. It is notcommonly used for messaging as data messaging is directed to dataenabled devices.

[0006] It is therefore advantageous to provide a mechanism to transformdata messages, entered via keyboards and the like, to voice for deliveryto voice enabled services. The present invention is directed to a methodand device for doing so.

SUMMARY OF THE INVENTION

[0007] In its most general form, the invention comprises a text entrydevice such as a keyboard or a keypad (e.g. a Television Remote Controlkeypad), a text capture software module, coupled to a TXTS module, andtransmission capacity of the TXTS module output to a messaging servercapable of delivering voice messages.

[0008] In one preferred embodiment, the invention operates inconjunction with a television messaging system (TMS hereinafter). TheTMS is adapted to deliver messages to a subscriber TV via a set-top box.A set-top box is a terminal adapted to receive television transmissionsthrough cable, radio transmission, satellite, data network such as DSLor ISDN, or any other form of transmitting television information. Theset-top box decodes the general television transmission, or the messagessent by the TMS, and presents it on the subscriber television. The userresponds to a message using a text entry device such as a keyboard or aremote control, coupled to the set-top box. A software text to speechmodule (TXTS module) in the set top box translates the text to voice.The output of the TXTS may be produced as a data stream or file, or asaudible voice. The output of the TTS module is transmitted to amessaging system adapted to provide voice messaging, known hereinafteras a target messaging system, or alternatively as a voice mail system.The transmission may be directly to the target messaging system, or viaan intermediate server. While it is preferred to transmit the TTS outputas a data file, it may also be transmi9tted as a voice by providing atelephone connection to the voice mail system and producing speech inaudible form.

[0009] Preferably, the user is allowed to review, and more preferably toedit his message.

[0010] In another embodiment of the invention, the TXTS module residesin a server remote to the user premises (Local Messaging Server or LMS).In such case, input may be provided via a keyboard, or alternatively viaa keypad in communication with the server, such as a telephone orcellular phone keypad. Communications may be performed via the set-topbox, e.g. with a keyboard connected thereto. In all implementations ofthe invention, an upstream network should be provided to transmit theuser message, in text, voice, or data format, to the target messagingsystem.

[0011] The TXTS functionality may be divided between the set-top box andthe LMS. For example keystrokes may be translated and encoded as voice,in the set-top box, and the output transferred to the LMS for storage ordelivery to the voice mail system.

[0012] Thus in a preferred embodiment, the invention provides for a textto voice messaging device, operating in conjunction with a televisionmessaging system, having messaging software, the system comprises aset-top box in communication with the television messaging system, andadapted to deliver a message through a television coupled thereto. Atext-receiving module executed in the set-top box, and adapted toreceive text from a user. A text to speech module coupled to said textreceiving module for transforming said text into speech, said text tospeech module adapted to produce a voice output corresponding to saidtext. A voice delivery module is coupled to the text to speech module,and adapted to deliver said output to a target messaging system capableof receiving voice messages.

[0013] The text to voice messaging device of may further comprising atext entry device to deliver user typed text to said text receivingmodule.

[0014] In one preferred embodiment, the output delivery module isadapted to transmit said output to the target voice messaging system ina voice data file format. In a more preferred embodiment, the outputdelivery module is adapted to transmit said output to the target voicemessaging system in a speech format.

[0015] Preferably, the set-top box is adapted to be coupled o an IPnetwork and deliver said output therethrough.

[0016] In the preferred embodiment, the target messaging system is aunified messaging system.

[0017] Alternatively, the text to voice messaging server, operates inconjunction with a television messaging system for delivering messagesto a user, the server comprising a text receiving module, adapted toreceive text input from a user, and a text to speech module executed onsaid server, and coupled to said text receiving module for transformingsaid text into speech. The text to speech module adapted to produce avoice output corresponding to said text. A voice delivery module adaptedto deliver said output to a target voice messaging system is coupled tothe text to speech module.

[0018] The text to voice messaging server may be adapted to receive saidtext input via an upstream network selected from a group consisting of atelevision distribution network, a telephone network, a cellularnetwork, a wireless network, a wired network, a satellite network, aterrestrial network, a DSL network, a data network or a combinationthereof.

[0019] The output delivery module may be adapted to transmit said outputto the target voice messaging system in a voice data file format, or totransmit said output to the target voice messaging system in a speechformat.

[0020] The target messaging server may be integrated into saidtelevision messaging server.

[0021] The server functionality described above may be distributedbetween a server and a set top box, for example, wherein the textreceiving module is operating on the set-top box, while the text tospeech may be implemented on the server.

[0022] The invention also describes a voice to text messaging systemoperating in conjunction with a television messaging system having atelevision messaging system, the voice to text messaging systemcomprising a server located remotely to a user premises, said serveradapted to deliver messaging to a television via a downstream network,and a set top box coupled to the downstream network. A text entry devicein communications with said set top box, is provided for text entry by auser. The text entry device is coupled to a speech to text moduleadapted to produce output representative said text in speech format, anda voice delivery module adapted to deliver said output to a targetmessaging server adapted to receive voice messages is coupled to thetext to speech module.

[0023] In another aspect, the present invention provides a messagingmethod comprising the steps of: outputting a message to a user using atelevision, receiving a response message from a user, said responsemessage comprising text, transforming said text into an output in aspeech format, and delivering said output to a messaging server adaptedto receive voice messages.

[0024] The step of transforming can be carried out by a set-top boxcoupled to said television. The set top box may be coupled to a datanetwork, e.g. an internet, wherein said step of delivering is performedvia said data network. Alternatively, step of delivering is performed byfeeding said signals to a telephone network.

[0025] Alternatively, a server remote to said television carries out thestep of transforming.

[0026] The speech output can be in the form of a file containing datarepresenting said speech, or the output may comprise electrical signalsrepresenting said speech.

[0027] It will be clear to a person skilled in the art that theinvention may be carried out by a computer program operating on ageneral purpose, or even specially adapted computer. It is thereforeclear that the invention further covers such computer, that whenexecuted by a computer, will cause the computer to operate substantiallyin accordance with the methods described above, or adapt the computer tooperate substantially in accordance with the systems described above. Inthis instance, the word computer program relates to the code itself, themedia on which it is carried, or its transmission. It further extends tothe compiled or un-compiled form, in the form of source code, objectcode, code intermediate sources (such as in a partially compiled form),or in any other form suitable for use in the implementation of theinvention. Computer programs may be standalone applications, softwarecomponents or plug-ins to other applications. Computer programs may beembodied on a carrier, being any entity or device capable of carryingthe computer program: for example, a storage medium such as ROM or RAM,optical recording media such as CD-ROM or magnetic recording media suchas floppy discs or hard disks. The carrier may be a transmissiblecarrier such as an electrical or optical signal conveyed by electricalor optical cable, or by radio or other means. Computer programs may beprovided for download across the Internet from a server. Computerprograms may also be embedded in an integrated circuit.

SHORT DESCRIPTION OF DRAWINGS

[0028]FIG. 1 describes a complete system of one embodiment of thepresent invention

[0029]FIG. 2 is a block diagram of modules comprising an embodiment ofthe invention.

[0030]FIG. 3 is a block diagram of another embodiment of the inventionshowing in context the invention implemented by a set-top box.

DETAILED DESCRIPTION

[0031] Referring now to FIGS. 1 and 2, a local messaging server 120 iscoupled to a television transmission system 122, which is in turncoupled to a television distribution system 140. Television distributionsystem, also referred to as the ‘downstream network’, may be any mediumadapted to carry television transmissions, for example a cable TVnetwork, a radio or other wireless network, a DSL network, a terrestrialor satellite based network, and other similar media. The downstreamnetwork is adapted to deliver the signal to a set-top box 110. Theset-top box may be any terminal device adapted to receive the messageand display it on a television 1 15 coupled thereto. In the preferredembodiment depicted in FIG. 1, the set top box is also coupled to akeyboard 130, or any other convenient text entry device, such as adedicated keypad.

[0032] When the user wishes to respond to a message, one option open tohim/her is to type the response via the keyboard 130. A text receivingmodule 205, preferably operated on the set-top box 110, captures thetext and transfers the text to a TXTS module 210. The TXTS module may beimplemented in software or in hardware, and may operate on the set-topbox, or on a server. In FIG. 1, the server is integrated into LMS 120,but it will apparent to those skilled in the art that more than oneserer may be deployed and the TXTS may reside on any server, preferablyremote to the user premises.

[0033] If the TXTS module operates on a server, a method of transferringthe text input is required. Such transfer occurs via an upstreamnetwork. The upstream network may comprise any convenient means capableof text data transfer, such as by telephone 185, using an uplink in thedistribution network 140, as provided by many cable networks, a wirelessor wired network, a cellular network, or a combination thereof.

[0034] Alternatively, the keyboard or data entry device 130 may becoupled to an upstream network, bypassing the set-top box. Either thetext entry device or the set-top box may be coupled to an upstreamnetwork different from the downstream network 140, or preferably, if thedistribution network is adapted to transfer data upstream as well asdownstream, via the distribution network 140.

[0035] The TXTS module produces an output, preferably in the form of adata file such as a WAV file. Wav files are a popular file format thatcontains data directly translated to sound. Other formats, e.g. MP3, andsimilar voice file formats may off course be used as well. Such file istransferred to the target messaging system 125 as a data file by suchmethods as the internet 175, a PSTN, 170, or network comprising eitherLAN or wan, and other methods which are a matter of technical selection.Alternatively, the TXTS may be implemented on the LMS 125. The transferof the TXTS output to the target messaging system is accomplished by avoice delivery module 215.

[0036] In an alternative embodiment, the TXTS module output is in avoice format, i.e. either an electrical stream representing the speech,or in a bit stream that may be directly decoded as such. In such a case,either the TXTS 210 or the voice delivery module, are adapted to dialthe target messaging system 125, and automatically deliver the voiceoutput thereto.

[0037] The target messaging system may be integrated into the LMS or tothe same server on which the TXTS is running., or may be any other voiceenabled system, ranging from a simple answering machine, to a cellularnetwork, a unified messaging server, a voice mail server, a telephonenetwork messaging system, and the like.

[0038] Addressing information may be provided directly by an incomingmessage, or preferably by an addressing module 220. The addressingmodule provides the voice delivery module 215 with the target address towhich the message should be sent. The addressing module may extract theinformation from an incoming message, receive indication from the user,(possibly guiding the user by messages via the television messagingsystem) or obtain information form a database 225 coupled thereto. Anexample is if the e-mail, name, or similar addressing information of therecipient is known, the database may contain records containing a voicemailbox associated with the recipient.

[0039]FIG. 3 shows an embodiment where the invention operates primarilywithin the set-top box 110. This embodiment shows yet anotherembodiment, where an incoming message decoding module 230 is utilized todecode an incoming message and extract a reply address therefrom.Alternatively, the user may enter the address via text receiving module205. Also noted in this embodiment is that the upstream networkinterface 235, may be coupled to any number of upstream networks, fordelivering the TXTS output. By way of a non-limiting example, a PSTN,Internet, and the distribution, or downstream network may be utilized.

[0040] Yet another method of delivering the message to the target systemwhen the TXTS module resides in the set-top box is carried out byconnecting the set top box with the PSTN network. The voice signal maybe delivered directly to the target messaging system 125 via optionaltelephony link 185 and 190. As described above, a similar method may beemployed in a server based TXTS module and voice delivery module.Clearly, the word telephony network relates to any network adapted totransfer at least voice, such as PSTN, ISDN, etc.

[0041] It will be appreciated that the invention is not limited to whathas been described hereinabove merely by way of example. While therehave been described what are at present considered to be the preferredembodiments of this invention, it will be obvious to those skilled inthe art that various other embodiments, changes, and modifications maybe made therein without departing from the spirit or scope of thisinvention and that it is, therefore, aimed to cover all such changes andmodifications as fall within the true spirit and scope of the invention,for which letters patent is applied.

What is claimed is:
 1. A text to voice messaging device, operating inconjunction with a television messaging system, having messagingsoftware, the system comprising: a set-top box in communication with thetelevision messaging system, and adapted to deliver a message through atelevision coupled thereto; a text receiving module executed in theset-top box, and adapted to receive text from a user; a text to speechmodule coupled to said text receiving module for transforming said textinto speech, said text to speech module adapted to produce a voiceoutput corresponding to said text; and, a voice delivery module adaptedto deliver said output to a target messaging system capable of receivingvoice messages.
 2. The text to voice messaging device of claim 1 furthercomprising a text entry device to deliver user typed text to said textreceiving module.
 3. The text to voice messaging device of claim 1wherein said output delivery module is adapted to transmit said outputto the target voice messaging system in a voice data file format.
 4. Thetext to voice messaging device of claim 1 further wherein said outputdelivery module is adapted to transmit said output to the target voicemessaging system in a speech format.
 5. The text to voice messagingdevice of claim 1 wherein said set-top box is adapted to be coupled onan IP network and deliver said output therethrough.
 6. the text to voicedevice wherein said target messaging system is a unified messagingsystem.
 7. A text to voice messaging server, operating in conjunctionwith a television messaging system for delivering messages to a user,the server comprising: a text receiving module, adapted to receive textinput from a text entry device; a text to speech module executed on saidserver, and coupled to said text receiving module for transforming saidtext into speech, said text to speech module adapted to produce a voiceoutput corresponding to said text; and, a voice delivery module adaptedto deliver said output to a target voice messaging system.
 8. The textto voice messaging server of claim 7 further adapted to receive saidtext input via an upstream network selected from a group consisting of atelevision distribution network, a telephone network, a cellularnetwork, a wireless network, a wired network, a satellite network, aterrestrial network, a DSL network, a data network or a combinationthereof.
 9. The text to voice messaging server of claim 7 wherein saidoutput delivery module is adapted to transmit said output to the targetvoice messaging system in a voice data file format.
 10. The text tovoice messaging server of claim 7 further wherein said output deliverymodule is adapted to transmit said output to the target voice messagingsystem in a speech format.
 11. The text to speech messaging server ofclaim 7 wherein said target messaging server is integrated into saidtelevision messaging server.
 12. The text to speech messaging server ofclaim 7 adapted to communicate with a text entry device via an upstreamnetwork.
 13. The text to speech server of claim 12, wherein saidupstream network is selected from a group comprising a telephonynetwork, a cellular network, a wireless network, a televisiondistribution network, a DSL network, and ISDN network, a cabletelevision network, an internet, or a combination thereof.
 14. The textto speech server of claim 7, adapted to communicate with a text entrydevice coupled thereto via a set-top box.
 15. A voice to text messagingsystem operating in conjunction with a television messaging systemhaving a television messaging system, the voice to text messaging systemcomprising: a server located remotely to a user premises, said serveradapted to deliver messaging to a television via a downstream network; aset top box coupled to said downstream network; a text entry device incommunications with said set top box, for text entry by a user; a speechto text module adapted to produce output representative said text inspeech format; and, a voice delivery module adapted to deliver saidoutput to a target messaging server adapted to receive voice messages.16. a messaging method comprising the steps of: outputting a message toa user using a television; receive a response message from a user, saidresponse message comprising text; transforming said text into an outputin a speech format; and, delivering said output to a messaging serveradapted to receive voice messages.
 17. The method of claim 16 whereinsaid step of transforming is carried out by a set-top box coupled tosaid television.
 18. the method of claim 17 wherein said set top box iscoupled to a data network and wherein said step of delivering isperformed via said data network.
 19. The method of claim 18 wherein saiddata network is an Internet.
 20. The method of claim 16 wherein saidstep of transforming is carried out by a server remote to saidtelevision.
 21. The method of claim 16 wherein said output is in theform of a file containing data representing said speech.
 22. The methodof claim 16 wherein said output comprises electrical signalsrepresenting said speech.
 23. The method of claim 22 wherein said stepof delivering is performed by feeding said signals to a telephonenetwork.
 24. A computer program that when executed by a computer, willcause the computer to operate substantially similarly to the voice totext messaging server of claim
 7. 25. A computer program that whenexecuted by a computer, will cause the computer to operate substantiallythe steps of the method of claim
 16. 26. A set-top box operating inconjunction with a television messaging system and adapted to deliver amessage through a television coupled thereto, the set-top boxcomprising: a text receiving module executed in the set-top box, coupledto a keyboard for receiving text from a user; a text to speech modulecoupled to said text receiving module for transforming said text intospeech, said text to speech module adapted to produce a voice outputcorresponding to said text; a voice delivery module adapted to deliversaid output to a target messaging system capable of receiving voicemessages; and, an upstream network interface capable of delivering saidoutput.